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Abstract 

■ The recognition problem for attribute- value grammars (AVGs) was shown 



to be undecidable by Johnson in 1988. Therefore, the general form of 
^) ■ AVGs is of no practical use. In this paper we study a very restricted form 

of AVG, for which the recognition problem is decidable (though still NP- 
■ complete), the R-AVG. We show that the R-AVG formalism captures all 

of the context free languages and more, and introduce a variation on the 
bj}' so-called off-line parsability constraint, the honest parsability constraint, 

which lets different types of R-AVG coincide precisely with well-known 
Oh 1 time complexity classes. 
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1 Introduction 



- *— i i 

Although a universal feature theory does not exist, there is a general under- 
standing of its objects. The objects of feature theories are abstract linguistic 
objects, e.g., an object "sentence," an object "masculine third person singu- 
lar," an object "verb," an object "noun phrase." These abstract objects have 
properties like "tense," "number," "predicate," "subject." The values of these 
properties are either atomic, like "present" and "singular," or abstract objects, 
like "verb" and "noun-phrase." The abstract objects are fully described by their 
properties and their values. Multiple descriptions for the properties and values 
of the abstract linguistic objects are presented in the literature. Examples are: 

1. Feature graphs, which are labeled rooted directed acyclic graphs G = 
(V, A), where F is a collection of labels, a sink in the graph represents an 
atomic value and the labeling function is an injective function / : VxAm 
F. 



*The author was supported in part by HC&M grant ERB4050PL93-0516. 
^The author was supported by the Foundation for language, speech and logic (TSL), which 
is funded by the Netherlands organization for scientific research (NWO) 



1 



Version Mar 7, 1995; LaTeXed on February 4, 2008 



2 



2. Attribute- value matrices, which are matrices in which the entries consist 
of an attribute and a value or a reentrance symbol. The values are either 
atomic or attribute- value matrices. 

From a computational point of view, all descriptions that are used in practi- 
cal problems are equivalent. Though there exist some theories with a consider- 
ably higher expressive power | JBS95 ]. For this paper we adopt the feature graph 



description, which we will define somewhat more formal in the next section. 
Attribute Value Languages(AVL) [ pmo92| ] consist of sets of logical formulas 
that describe classes of feature graphs, by expressing constraints on the type 
of paths that can exist within the graphs. To wit: In a sentence like "a man 
walks" the edges labeled with "person" that leave the nodes labeled "a man" 
and "walks" should both end in a node labeled "singular." Such a constraint 
is called a "path equation" in the attribute-value language. 

A rewrite grammar |Cho56| can be enriched with an AVL to construct an 
Attribute Value Grammar (AVG), which consists of pairs of rewrite-rules and 
logical formulas. The rewrite rule is applicable to a production (nonterminal) 
only if the logical formula that expresses the relation between left- and right- 
hand side of the rule evaluates to true. The recognition problem for attribute- 
value grammars can be stated as: Given a grammar G and a string w does there 
exist a derivation in G, that respects the constraints given by its AVL, and that 
ends in w. As the intermediate productions correspond to feature graphs this 
question can also be formulated as a question about the existence of a consistent 
sequence of feature graphs that results in a feature graph describing w. For the 
rewrite grammar, any formalism in the Chomsky hierarchy (from regular to 
type 0) can be chosen. From a computational point of view it is of course most 
desirable to restrict oneself to a formalism that on the one hand gives enough 
expressibility to describe a large fragment of the (natural) language, and on the 
other hand is restrictive enough to preserve feasibility. For a discussion on the 



linguistic significance of such restrictions, see | Per84 |. 



Johnson [ |Joh88 ] proved that attribute- value grammars that are as restrictive 



as being equipped with a rewrite grammar that is regular can already give rise 
to an undecidable recognition problem. Obviously, to be of any practical use, 
the rewrite grammar or the attribute-value language must be more restrictive. 
Johnson proposed to add the off-line parsability constraint, which is respected 
if the rewrite grammar has no chain- or e-rules. Then, the number of applica- 
tions in a production is linear and the size of the structure corresponding to the 
partial productions is polynomial. Hence as by a modification of Smolka's al- 
gorithm [ |5mo92 | consistency of intermediate steps can be checked in quadratic 



time, the complexity of the recognition problem can at most be (nondetermin- 



istic) polynomial time. This observation was made in [Tra95], which also has 
an ./VP-hardness proof of the recognition problem. 

We further investigate the properties of these restricted AVGs (R-AVGs). 
In the next section, we give some more formal definitions and notations. In 
Section |H we show that the class of languages generated by an R-AVG (R-AVGL) 
includes the class of context free languages (CFL). It follows that any easily 
parsable class of languages (like CFL) is a proper subset of R-AVGL, unless 
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P = NP. Likewise, R-AVGL is a proper subset of the class of context sensitive 
languages, unless NP = PSPACE. In Section ||] we propose a further refinement 
on the off-line parsability constraint, which allows R-AVGs that respect this 
constraint to capture precisely complexity classes like NP or NEXP. That is, 
for any language L that has an jVP-parser, there exists an R-AVG, say G, such 
that L = L(G). Though our refinement, the honest parsability constraint is 
probably not a property that can be decided for arbitrary R-AVGs, we show 
that R-AVGs can be equipped with restricting mechanisms that enforce this 



property. The techniques that prove Theorem |3.1| and Theorem |4.2| result from 
Johnson's work. Therefore, the proofs of these theorems are deferred to the 
appendices. 

2 Definitions and Notation 

2.1 Attribute- Value Grammars 



The definitions in this section are in the spirit of |Ioh88 , Section 3.2] and [3mo92, 



Sections 3-4]. Consider three sets of pairwise disjoint symbols. 
A, the finite set of constants, denoted (a, b, c, . . .) 
V, the countable set of variables, denoted (x, y,z, . . .) 
L, the finite set of attributes, also called features, denoted (f,g, h, . 



Definition 1: An f-edge from x to s is a triple (x,f,s) such that x is a 
variable, / is an attribute, and s is a constant or a variable. A path, p, is a, 
possibly empty, sequence of /-edges {x\, /i, x 2 ), (x 2 , / 2 , x 3 ), . . . , (x n , /„, s) 
in which the X{ are variables and s is either a variable or a constant. Often 
a path is denoted by the sequence of its edges' attributes, in reversed order, 
e.g., p = f n . . . f\. Let p be a path, ps denotes the path that starts from s, 
where s is a constant only if p is the empty path. If the path is nonempty, 
p = f n ■ ■ ■ fi (ngeql), then s is a variable. For paths ps and qt we write 
ps = qt iff p and q start in s and t respectively and end in the same variable 
or constant. The expression ps = qs is called a path equation. A feature 
graph is either a pair (a, 0), or a pair (x,E) where x is the root and E a 
finite set of /-edges such that: 

1. if (y, /, s) and (y, /, t) are in E, then s = t; 

2. if (y, /, s) is in E, then there is a path from x to y in E. 



Definition 2: An attribute-value language A(A, V, L) consists of sets of log- 
ical formulas that describe feature graphs, by expressing constraints on the 
type of paths that can exist within the graphs. 

• The terms of an attribute-value language A(A, V, L) are the constants 
and the variables s, t £ ADV. 
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• The formulas of an attribute-value language A(A, V, L) are path equa- 
tions and Boolean combinations of path equations. Thus all formulas 
are either ps = qt, where ps and qt are paths, or (ft A ip, (ft V ip, or -xft, 
where (ft and ift are formulas. 

Assume a finite set Lex (of lexical forms) and a finite set Cat (of categories). 
Lex will play the role of the set of terminals and Cat will play the role of the 
set of nonterminals in the productions. 

Definition 3: A constituent structure tree (CST) is a labeled tree in which 
the internal nodes are labeled with elements of Cat and the leaves are 
labeled with elements of Lex. 



Definition 4: Let T be a constituent structure tree and F be a set of for- 
mulas in an attribute- value language A(A, V, L). An annotated constituent 
structure tree is a triple <T, F, h>, where h is a function that maps internal 
nodes in T onto variables in F. 



Definition 5: A lexicon is a finite subset of Lex x Cat xA(A, {xo}, L). A set 
of syntactic rules is a finite subset of Ui>i Cat x Cat* x_4.(t4, {xo, • • • , L). 
An attribute-value grammar is a triple <lexicon, rules, start >, where lexi- 
con is a lexicon, rules is a set of syntactic rules and start is an element of 
Cat. 



Definition 6: 

1. [BDG88, p .150] A class C of sets is recursively presentable iff 
there is an effective enumeration Mi , M2 , . . . of deterministic Turing 
machines which halt on all their inputs, and such that C = {L(Mi) \ 
i = 1,2,...}. 

2. We say that a class of grammars Q is recursively presentable iff the 
class of sets {L(G) \ G G Q} is recursively presentable. 



2.2 Restricted Attribute- Value Grammars 

The only formulas that are allowed in the attribute- value language of restricted 
attribute-value grammars (R-AVGs) are path-equations and conjunctions of 
path-equations (i.e. disjunctions and negations are out). We will denote the 
attribute- value language of an R-AVG by A'(A,V,L) to make the distinction 
clear. The CST of an R-AVG is produced by a chain- and e-rule free regular 
grammar. The CST of an R-AVG can be either a left-branching or a right- 
branching tree, since the grammar contains at most one nonterminal in each 
rule. 
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Definition 7: The set of syntactic rules of a restricted attribute-value gram- 
mar is a subset of Ui>i k<i Cat x Lex* x Cat fc x A' (A, {xo, x^}, L). A re- 
stricted attribute-value grammar is a pair <rules, start >, where rules is a 
set of syntactic rules and start is an element of Cat. 



Definition 8: An R-AVG <rules, start> generates an annotated constituent 
structure tree <T, F, h> iff 

1. the root node of T is start, and 

2. every internal node of T is licensed by a syntactic rule, and 

3. the set F is consistent, i.e., describes a feature graph. 

Let 4>[x/y] stand for the formula (ft in which all variable y is substituted for 
variable x. An internal node v of an annotated constituent structure tree 
is licensed by a syntactic rule (co, Zi, . . . , k, (f>) iff 

1. the node v is labeled with category cq, h(v) = no, and 

2. all daughters of v are leaves, which are labeled with l\ . . . ij, and 

3. 0[xo/no] is in the set F. 

An internal node v of an annotated constituent structure tree is licensed 
by a syntactic rule (co, h, . . . , h, ci, (f)) iff 

1. the node v is labeled with category co, h(v) = no, and 

2. one of v J s daughters is an internal node, v\, which is labeled with 
category c±, and h(v\) = n\, and 

3. the daughters of v that are leaves are labeled with l\ . . . li, and 

4. cp[xo/nQ,xi/ni] is in the set F. 



3 Weak Generative Capacity 

In |fga9l , it is shown that the recognition problem for R-AVGs is iVP-complete. 
This seems to indicate that although the mechanism for generating CSTs in R- 
AVGs is extremely simple, the generative capacity of R-AVGs is different from 
the generative capacity of e.g., context free languages (CFLs), which have a 
polynomial time parsing algorithm [ Ear70 |. Yet, a priori, there may exist CFLs 
that do not have an R-AVG. 



Theorem 3.1 Let L be a context free language. There exists an R-AVG G 
such that L = L(G). 

Proof. If L is a context free language, then there exists a context free grammar 
G' in Greibach normal form such that L = L(G'). From this grammar G' , we 
can construct a pushdown store M that accepts exactly the words in L{G') = L. 
Such a pushdown store M is actually a finite state automaton M' with a stack 
S. The finite state automaton M' may be simulated by a chain- and e-rule free 
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regular grammar. Furthermore, we can construct an attribute- value language 
A' (A, V, L) that simulates the stack S. Thus it should be clear that there exists 
an R-AVG G that produces word w iff w G L{G'). Details of this construction 
are deferred to Appendix [A|. □ 

From this we can draw the conclusion that the class of context free languages 
is indeed a proper subset of the class of R-AVG languages, unless P = NP. 

Theorem 3.2 Let C be a recursively presentable class of grammars such that: 

1. G G C can be decided in time polynomial in \G\ 

2. G => w can be decided in time polynomial in \G\ + \ w\. 

If every R-AVG G has a grammar in C then P = NP. In fact, for every 
language L in NP there is an explicit deterministic polynomial time algorithm. 



Proof. Let L be a language in NP and w G {0,1}*. Trautwein [Tra95] pro- 
vided an R-AVG G and a reduction that maps any formula F onto a string 
wf s.t. G =£• wf iff F G SAT. It was also shown that any R-AVG has a 
nondeterministic polynomial time, hence deterministic exponential time, recog- 
nition algorithm. Suppose every R-AVG G has a grammar in C. Then there 
exists a G' G C with L(G') = L(G). We can decide in polynomial time whether 
w F G L(G) for any w F - So, P = NP. 

If every R-AVG G has a grammar in C, then the algorithm for deciding u w G 
L?" consists of: use Cook's reduction to produce a formula F that is satisfiable 
iff w G L; use Trautwein's reduction to produce wp and R-AVG G; enumerate 
grammars in C for the first grammar G' that has a description of length less 
than log logH for which L(G)n{0, 1}< 1o s 1o sM = L(G')n{0, l}< lo g lo sM accept 
iff w G L{G'). This gives a polynomial time algorithm that erroneously accepts 
or rejects w for only a finite number of strings w. The theorem now follows 
from the fact that both P and NP are closed under finite variation. □ 



Corollary 3.3 If R-AVGs generate only context free languages then P = NP. 

In fact it can be shown directly that R-AVGs also produce non-context free 
languages. 

Theorem 3.4 The context sensitive language {a n b n c n } is generated by an R- 
AVG. 

Proof. (Sketch) Typically, the R-AVG that generates the language {a n b n c n } first 
generates an amount of a's then an amount of 6's and finally an amount of c's. 
Let us assume that the grammar generates i a's. During the derivation, the 
feature graph can be used to store the amount of a's that is produced. Once 
the grammar starts to produce 6's , the feature graph will force the grammar 
to generate exactly i 6's and next to generate exactly i c's as well. □ 
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4 The Honest Parsability Constraint and Consequences 



According to Theorem |3,2| , it is unlikely that the languages generated by R- 
AVGs can be limited to those languages with a polynomial time recognition 



algorithm. Trautwein | Tra95 1 showed that all R-AVGs have nondeterministic 
polynomial time algorithms. Is it perhaps the case that any language that has 
a nondeterministic polynomial time recognition algorithm can be generated by 
an R-AVG. Does there exist a tight relation between time bounded machines 
and R-AVGs as e.g., between LB As and CSLs? The answer is that the off-line 
parsability constraint that forces the R-AVG to have no chain- or e-rules is just 
too restrictive to allow such a connection. The following trick to alleviate this 
problem has been observed earlier in complexity theory. The off-line parsability 
constraint(OLP) [ poh88| relates the amount of "work" done by the grammar 
to produce a string linearly to the number of terminal symbols produced. It is 
therefore a sort of honesty constraint that is also demanded of functions that 
are used in e.g., cryptography. There the deal is, for each polynomial amount of 
work done to compute the function at least one bit of output must be produced. 
In such a way, for polynomial time computable functions one can guarantee that 
the inverse of the function is computable in nondeterministic polynomial time. 

As a more liberal constraint on R-AVGs we propose an analogous variation 
on the OLP 

Definition 9: A grammar G satisfies the Honest Parsability Constraint(HPC) 
iff there exists a polynomial p s.t. for each w in L(G) there exists a deriva- 
tion with at most steps. 

From Smolka's algorithm and Trautwein's observation it trivially follows 
that any attribute-value grammar that satisfies the HPC (HP-AVG) has an NP 
recognition algorithm. The problem with the HPC is of course that it is not a 
syntactic property of grammars. The question whether a given AVG satisfies 
the HPC (or the OLP for that matter) may well be undecidable. Nonetheless, 
we can produce a set of rules that, when added to an attribute- value grammar 
enforces the HPC. The newly produced language is then a subset of the old 
produced language with an NP recognition algorithm. Because of the fact that 
our addition may simulate any polynomial restriction, we regain the full class 
of AVG's that satisfy the HPC. In fact 

Theorem 4.1 The class, P-AVGL, of languages produced by the HP-AVGs is 

recursively presentable. 

We will give a detailed construction of such a set of rules in Appendix [B]. 
The existence of such a set of rules and the work of Johnson now gives the 
following theorem. 

Theorem 4.2 For any language L that has an NP recognition algorithm, there 
exists a restricted attribute-value grammar G that respects the HPC and such 
that L = L(G). 
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Proof. (Sketch) Let M be the Turing machine that decides w £ L. Use a vari- 
ation of Johnson's construction of a Turing machine to create an R-AVG that 
can produce any string w that is recognized by M. Add the set of rules that 
guarantee that only strings that can be produced with a polynomial number of 
rules can be produced by the grammar. □ 



5 Veer out the HPC 

Instead of creating a counter of logarithmic size as we do in Appendix |^, it is 
quite straightforward to construct a counter of linear size (or exponential size 
if there is enough time). In fact, for well-behaved functions, the construction 
of a counter gives a method to enforce any desired time bound constraint on 
the recognition problem for attribute- value grammars. For instance, for nonde- 
terministic exponential time we could define the Linear Dishonest Parsability 
Constraint (LDP) (allowing a linear exponential number of steps) which would 
give. 

Theorem 5.1 The class of languages generated by R-AVGs obeying the LDP 
condition is exactly NE. 
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A Simulating a Context Free Grammar in GNF 

A context free grammar (CFG) is a quadruple (N, S, P, S), where ./V is a set of 
nonterminals, £ is a set of terminals, P is a set of productions, and S S N is the 
start nonterminal. A CFG is in Greibach normalform (GNF) if, and only if, the 
productions are of one of the following forms, where a E S, A G N, A\ . . . A n G 
N \ {S} and e the empty string (c.f., fHU7|, §ud8|): 



A — > aA\...A n 
A -> a 
S -» e 



Given a GNF G = (N, S, P, S), we can construct a restricted attribute- value 
grammar (R-AVG) G' that simulates grammar G. R-AVG G' consists of the 
same set of nonterminals and terminals as GNF G. The productions of R-AVG 
G' are described by Table |l[ The only two attributes of R-AVG G' are top 
and rest. R-AVG G' contains \N\ + 1 atomic values, one atomic value for each 
nonterminal and the special atomic value $. The R-AVG G' uses the feature 
graph to encode a push-down stack, similar to the encoding of a list. The stack 
will be used to store the nonterminals that still have to be rewritten. 

The three syntactic abbreviations below are used to clarify the simulation. 
We use represent a stack by a Greek letter, or a string of symbols; the top 
of the stack is the leftmost symbol of the string. Let xq encode a stack 7, 
then the formulas in the abbreviation push(^4o • • • A n ) express that x\ encodes 
a stack Aq . . . A n ^. Likewise, the formulas in the abbreviation pop(j4) express 
that xq encodes a stack Aj, and X\ encodes the stack 7. The abbreviation 
empty-stack expresses that xq encodes an empty stack. 

push(A . . . A n ) stands for top(xi) = Aq /\ 

top rest(xi) = A\ A 

TOP REST n (xi) = An A 
REST n+1 (xi) = Xq 
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POP(A) stands for top(x ) = A A 
rest(xo) = Xi 
EMPTY-STACK stands for x = $ 



Productions of GNF G 


Productions of R-AVG G' 




S — > aA 1 ...A n 


~> S -> aAi 






PUSH(A.2 . . . A„) A EMPTY-STACK 




A aAx ...A n 


A — ► aAi 






PUSH(A2 . ..An) 


(A^S) 


S — > a 


~> S ^aB 


VB s N \ {S} 




POP(B) A EMPTY-STACK 




S^a 


~» 5 — ► a 






EMPTY-STACK 




A^ a 


~> A — ► a_B 


VS € N \ {S} 




POP(B) 


(A^S) 


A^ a 


~> A — > a 






EMPTY-STACK 






neglected 





Table 1: Simulating productions of GNF G by R-AVG G' 



We have to prove that GNF G and its simulation by R-AVG G' generate 
(almost) the same language. Obviously, R-AVG G' cannot generate the empty 
string. However, for all non-empty strings the following theorem holds. 

Theorem A.l Start nonterminal S of GNF G derives string a (a G S + j if, 
and only if, start nonterminal S of R-AVG G' derives string a with the empty 
stack. 

Proof. There are two cases to consider. First, S derives string a in one step. 
Second, S derives string a in more than one step. The lemma below is needed 
in the proof of the second case. 

Case I Let start nonterminal S derive string a in one step. GNF G contains 
a production S — > a iff R-AVG G' contains a production S — > a with the 
equation empty-stack. So, S derives a in a derivation of GNF G iff S 
derives a with an empty stack in the derivation of R-AVG G' . 

Case II Initial nonterminal S of GNF G derives string a = (3(3' in more than 
one step iff there is a left-most derivation S 4> (3 A (3(3'. GNF G 
contains production A — > (3' iff R-AVG G' contains production A — > (3' 
with the equation empty-stack. By the next lemma: S (3A iff S 
(3 A with the empty stack. Hence S derives a for GNF G iff S derives a 
with empty stack for R-AVG G'. ^ 

Lemma A. 2 Start nonterminal S derives aAj (a G £+,^17 € (N\{S}) + ) in 
a left-most derivation of GNF G if, and only if, nonterminal S derives aA with 
stack 7$ ($ is the bottom- of- stack symbol) in the derivation of R-AVG G' . 
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Proof. The lemma is proven by induction on the length of the derivation. 

Basis If S derives a Ay in one step, then GNF G contains production S — > aA'y 
and R-AVG G' contains production S — ► a A with stack 7$. If S derives 
aA with stack 7$ in one step, then R-AVG G' contains production S — » 
aA with stack 7$ and GNF G contains production S — > aA'y. 

Induction The induction hypotheses states that S =5* aA'y in GNF G iff S 
a^4 with stack 7$ in R-AVG G' . Next, we distinguish three cases. 

1. GNF G contains a production A — > aA\A2 ■ ■ ■ A n . Hence there is 
a left-most derivation S =4> aaA\A2 ■ ■ ■ A n y. GNF G contains the 
production ^4 — > aA\Ai . . . A n iff R-AVG G' contains a production 
A — > a^4i with equation push(^42 • • -A n ). Since the induction hy- 
potheses states that there is a derivation S => aA with stack 7$, 
there is a derivation S aaA\ with stack A2 ■ ■ ■ A n y$. 

2. GNF G contains a production A — > a and 7 = S't 7 . Hence there is a 
left-most derivation S aaB'y'. GNF G contains the production 
^4 — > a iff R-AVG G' contains productions A — > a-B with equation 
pop(B), for all .£? £ iV \ {5 1 }. Hence by the induction hypotheses, 
there is a derivation S aaB' with stack 7'$. 

3. GNF G contains a production A — > a and 7 = e. Then there is a left- 
most derivation S aa. GNF G contains the production A — > a iff 
R-AVG G" contains production A —> a with equation empty- STACK. 
Hence by the induction hypotheses, there is a derivation S => aa 
with stack $. U 

Because every context free language is generated by some GNF G, every 
context free language is generated by some R-AVG G'. 



B Constructing an Honestly Parsable Attribute- Value 
Grammar 

In this section we show how to add a binary counter to an attribute- value gram- 
mar (AVG). This counter enforces the Honest-Parsability Constraint (HPC) 
upon the AVG. To keep this section legible we sometimes use the attribute- 



value matrices (AVMs) as descriptions. In Section B.2, we show how to create 



a counter for the AVG. In Section B.3 we show how to extend the syntactic 
rules and the lexicon of the AVG. 



B.l Arithmetic by AVGs 

We start with a little bit of arithmetic. 
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Natural numbers. The AVMs below encode natural numbers in binary no- 
tation. The sequences of attributes and 1 in these AVMs encode natural 
numbers, from least- to most-significant bit. The attribute v has value 1 (or 0) 
if, and only if, it has a sister attribute 1 (or 0). 



1. The AVMs 
one. 



v 
+ 



and 



v 1 
1 + 



2. The AVMs 



v 
[F] 

[F] encodes a natura" 



and 



v 1 
1 [F] 
number. 



encode the natural numbers zero and 



encode natural numbers iff the AVM 



minal A with some AVM 



where [F] and [H] encode natural number 



Syntactic rules that tests two numbers for equality. Assume a nonter- 

n [F] 
m [H] 

x and y, respectively. We present one syntactic rule that derives from this 

n [F] 
u [H] 



nonterminal A a nonterminal B with AVM 



if x = y. 



A-+ B 
n(xo) = m(x ) 
Aid = x\ 



Table 2: The rule to test two numbers for equality. 

Clearly, this simple test takes one step. A more sophisticated test, which 
also tests for inequality, would compare [F] and [G] bit-by-bit. Such a test 
would take 0(min(log(x), log(y))) = 0(min(|[F]|, |[-H]|)) derivation steps. 



Syntactic rules that multiply by two. Assume a nonterminal A with 



some AVM 
syntactic ru 



N [F] , where [F] encodes natural number x. We present one 



e that derives from this nonterminal A a nonterminal B with the 



AVM 



N [H] 



, where [H] encodes natural number 2x. 
The number N in [H] equals two times N in [F] if, and only if, the least- 
significant bit of N in [H] is 0, and the remaining bits form the same sequence 
as the number N in [F]. Multiplication by two takes one derivation step. 



A-> B 




v n(xi) 


= 


AN(l(i) 


= n(xi) 



Table 3: The rule to multiply by two. 



Syntactic rules that increments by one. Assume a nonterminal A with 
some AVM N [F] , where [F] encodes natural number x. We present five 
syntactic rules that derive from this nonterminal A a nonterminal C with AVM 



N 



[H] , where [H] encodes natural number x + 1. 
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The increment of N requires two additional pointers in the AVM of A: at- 
tribute P points to the next bit that has to be incremented; attribute Q points to 
the most-significant bit of the (intermediate) result. These additional pointers 
are hidden from the AVMs of the nonterminals A and C. 

The five rules from Table || increment N by one. Nonterminal A rewrites, in 
one or more steps, to nonterminal C, potentially through a number of nonter- 
minals B. 



A'—> C 

v N(a; ) == 

A n(xq) 1 n(xi) 

A V N(sci) = 1 


B 

v n(x ) = 1 
A 1 n(x ) = P(a;i) 
A n(xi) = Q(a;i) 
A v n(xi) = 


B^ B 
v p(x ) = 1 
A 1 p(x ) = p(xi) 
An(xo) = n(xi) 
A v q(xo) = 
AO Q(a;o) = Q(xi) 


B— » C 
v p(x ) = 
Av Q(x ) = 1 
A p(x ) = 1 Q(a;o) 
An(xo) = n(xi) 


B^ C 
v p(xo) = 1 
A 1 P(x ) = + 
An(io) = n(xi) 
A v Q(x ) = 
A v Q(x Q ) = 1 
A 1 Q(x ) = + 





Table 4: Five rules to increment N by one. 



The first and fourth rule of Table ^| state that adding one to a zero bit 
sets this bit to one and ends the increment. The second and third rule state 
that adding one to a one bit sets this bit to zero and the increment continues. 
The fifth rule states that adding one to the most-significant bit sets this bit to 
zero and yields a new most-significant one bit. We claim that A C takes 
0(\og(x)) = 0(\[F]\) derivation steps. 

Rules, similar to the ones above, can be given that decrement the attribute 
N by one. We only have to take a little extra care that the number cannot be 
decremented. 



Syntactic rules that sum two numbers. In this section we use the pre- 
vious test and increment rules (indicated by =). Assume a nonterminal A 



with some AVM 



N 
M 



[F] 
[H] 



, where [F] and [H] encode natural number x and 



y, respectively. We present syntactic rules (Table ||-8|) that derive from this 

where [F'] encodes the 



nonterminal A a nonterminal C with AVM 



N 
M 



[F'] 
[H] 



natural number x + y. 

The increment of N by M is similar to the increment by one. Here, three 
additional pointers are required: the attributes p and Q point to the bits in 
N and M respectively that have to be summed next; attribute R points to the 
most-significant bit of the (intermediate) result. In the addition two states 
are distinguished. In the one state, the carry bit is zero, indicated by nonter- 
minal A' . In the other state, the carry bit is one, indicated by nonterminal 
B. We claim that A 4> C takes 0(max(log(x), log(y))) = 0(max(|[F]|, \[H}\)) 
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A~> A' 




C"^ C 


M(aso) = 




N(as ) = n(xi) 


An(io) 


= p(xi) 


Am(io) = M(ari) 


AM(ii) 


= Q(xi) 




ar(h) 


= N(a;i) 





Table 5: Two rules to hide the auxiliary pointers. 



v p(aio) = 
A v q(x ) = 
A v k(xo) = 
AO P(x ) = p{xi) 
AO Q(s ) = Q(xi) 
A r(x- ) R(a;i) 
An(x ) = N(a5i) 
Allfio) = m(xi) 


A'—> A' 
v p(as ) = 1 
A v Q(x ) = 
A v r(ibo) = 1 
A 1 p(x ) = p(xi) 
AO Q(x ) = q(xi) 
A 1 r(xq) = r(xi) 
An(i ) = n(xi) 
A M(aco) = m(xi) 


A'— > A' 
v p(z ) = 
Av q(x ) = 1 
A v r(xo) = 1 
AO P(x ) = p(xi) 
A 1 q(x q ) = Q(a;i) 
A 1 r(x ) = r(xi) 
An(io) = n(si) 
A u(x ) = M(xi) 


B 

v p(x q ) = 1 
A v q(x ) = 
A v r(x ) = 
A 1 p(a:o) = p(xi) 
AO q(x ) = Q(xi) 
A R(a;o) = R(xi) 
An(ic) = n(ki) 
Am(io) = m(xi) 


B^ B 

v p(x ) = 
Av q(x ) = 1 
A v R.(x ) = 
AO p(x ) = p(a;i) 

A 1 Q(iEo) = Q(xi) 

A r(o;o) = R(a;i) 
An(io) = n(xi) 
AM(io) = m(xi) 


B^ B 

v p(a>o) = 1 
A v q(x ) = 1 
A v r{x ) = 1 
A 1 p(x ) = p{xi) 
A 1 Q(x ) = Q(xi) 
A 1 r(x ) = R(xi) 
An(i ) = n(xi) 
A m(x-o) = m(xi) 



Table 6: Rules when the carry bit is not changed. 



derivation steps. 



Syntactic rules that sum a sequence of numbers. In this section we use 
the previous summation rules (indicated by =). Assume a nonterminal A with 



some AVM 



L [F'] , where [F'j encodes a list of numbers. To wit 



[F'] 



id 



' F [G 2 




R ... 


F [G n ] 






R + 





that derive from this nonterminal A a nonterminal B with AVM 



where [d] encodes natural number X{. We present syntactic rules (Table ||) 

SUML [F] 

_ L [F'] 
where [F] encodes the natural number SjXj. 

The summation requires an additional pointer in the AVM [F']: attribute 
p points to the next element in the list that has to be summed. We claim that 
A =4> B takes 0(Sj log (a:*)) = 0(|[F']|) derivation steps. 
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A'-> B 




B^ A' 




v p(x ) = 


1 


v p(x ) = 





A v q(cco) 


= 1 


A v q(x ) 


= 


A v r(x ) 


= 


Av r(xo) 


= 1 


A 1 p(x q ) 


= p{xi) 


AO P(x ) 


= p(xi) 


A 1 Q(x ) 


= Q(xi) 


AO Q(a;o) 


= Q(xi) 


AO R(a:o) 


= R(a;i) 


A 1 r(xo) 


= R(xi) 


AN(so) = 


n(xi) 


An(i ) = 


n(xi) 


Am(i ) = 


m(xi) 


AM(io) = 


u(xi) 



Table 7: Rules when the carry bit is changed. 



A'-* C 


A'—> C 


A'—> C 


p(x ) = + 


p(x ) = i 


H x o) = + 


A Q(xo) = i 


AQ(xo) = + 


AQ(io) = + 


A R(a;o) = j 


AR.(x ) =j 


Ar(x ) = + 


Ai=j 


M = j 


A xo = x\ 


A xo = xi 


A ao = xi 




B^ C" 


C" 


B^ C 


p(x ) = + 


p(x ) = z 


p(x ) = + 


AQ(x ) = 2 


AQ(xo) = + 


AQ(io) = + 


A r(xo) = z + 1 


A r(xo) = 2 + 1 


A v r(x ) = 1 


A xq = xi 


A xo = xi 


A 1 r(x ) = + 






A xo = xi 



Table 8: Rules that stop the summation. 



B.2 Creating a counter of logarithmic size 

Create an AVM of the following form: 



COUNTER 



SIZE 



M 



POLY 



1 U 




■ [1+] 


V 


1 uo 


1 uo 




' v 1 UO 




[1+] 


V 


1 uo 


1 uo 




v 1 UO 




... [1+] 


1 U 




■ [1 +] ] 



Attribute COUNTER is used to distinguish the AVMs that encodes the counter 
from those in the original attribute-value grammar. We will neglect the at- 
tribute COUNTER in the remainder of this section, because it is not essential 
here. The attributes size, n, m and poly encode natural numbers. The at- 
tribute size records the size of the string that will be generated. The attribute 
POLY records the maximum number of derivation steps that is allowed for a 
string of size size. The attributes N and M are auxiliary numbers. 

The construction of the counter starts with an initiation-step. The further 
construction of the counter consists of cycles of two phases. Each cycle starts 
in nonterminal A. 
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A-^ A' 




A'— » A' 


A'~> B 


v n(xi) = 





SUML(io) = y 


p{x ) = + 


AO n(xi) 


= + 


A F P(a:o) = z 


ASUML(lo) = SUML(li) 


A h(xo) = 


L(a;i) 


A SUMl(h) = y + z 


Al(io) = l(xi) 


A l(x ) = 


p(xi) 


A R p(a;o) = p(xi) 








Ah(x ) = l(xi) 





Table 9: Three rules that sum a list of numbers. 



Initiation step and first phase. The initiation-step sets the numbers SIZE 
and N to 0, and the numbers M and POLY to 1. In the first phase of each cycle, 
the numbers size and N are incremented by 1. 



A 


A^ B 


v siZE(a;i) = 


SlZE(a;o) = x 


A siZE(a;i) = + 


AsiZE(a;i) = x + 1 


A v N(a;i) = 


An(io) = y 


AO n(xi) = + 


An(xi) = y + 1 


A v m(xi) = 1 


A m(xo) = m(xi) 


A 1 m(xi) = + 


APOLY(a:o) = POLY(a;i) 


A 1 polypi) = + 





Table 10: Initiation-step and first phase. 



The second phase of the cycle. In this phase the numbers N and M are 
compared. If N is twice M, then (i) number poly is extended by k bits, (ii) 
number M is doubled, and (in) number N is set to 0. If N is less than twice M, 
nothing happens. 

The left rule of the second phase doubles the number M in the second and 
the third equation. The test "Is N equal to 2m?" therefore reduces to one (the 
first) equation. The fourth equation extend the number POLY with k bits. The 
fifth and sixth equations set the number N to 0. 

The right rule is always applicable. If the right rule is used where the left 
rule was applicable, then the number N will never be equal to 2m in the rest of 
the derivation. Thus poly will not be extended any more. 



B^ A 


B^ A 


n(x ) = m(xi) 


x = x 1 


Am(xo) = x 




Au(xi) = 2x 




Apoly(xo) = fc poly(xi) 




Av n(xi) = 




AO n(xi) = + 





Table 11: The second phase. 



We claim that the left rule appears log(n) times and the right rule 0(n) 
times in a derivation for input of size n. Obviously, the number poly is 
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0(2 klogt ) = 0(i k ) when the number size is i. 
B.3 From AVG to HP-AVG 

In this section we show how to transform an AVG into an AVG that satisfies the 
HPC (HP-AVG). Since all computation steps of the HP-AVG only require a lin- 
ear amount of derivation steps, total derivations of HP-AVGs have polynomial 
length. 

We can divide the attributes of the HP-AVG into two groups. The attributes 
that encode the counters, and the attributes of the original AVG. The former 
will be embedded under the attribute COUNTER, the latter under the attribute 
grammar. In the sequel, we mean by 0| grammar the formula (j) embedded 
under the attribute grammar, i.e., the formula obtained from <p by substituting 
the variables Xi by GRAMMAR(xj). 

The HP-AVG is obtained from the AVG in three steps: change the start 
nonterminal, the lexicon and the syntactic rules. First, the HP-AVG contains 
the rules of the previous section, which construct the counter. The nonterminal 



S from Table |i0| is the start nonterminal of the HP-AVG. For the nonterminal 

is 



A the start nonterminal of the AVG is taken. Nonterminal B from Table 11 
a fresh nonterminal, not occurring in the AVG. 

Second, the HP-AVG contains an extension of the lexicon of the AVG. The 
entries of the lexicon are extended in the following way. The size of the lexical 
form is set to one, and the amount of derivation steps is zero. Thus, if (w, X, (f>) 
is the lexicon of the AVG, then (w, X, ip) is the lexicon of the HP-AVG, where 

ip = 4> | GRAMMAR 

A V SIZE COUNTER(xo) = 1 

A 1 SIZE COUNTER(a;o) = + 

A POLY COUNTER(xo) = + 

Third, the HP-AVG contains extensions of the syntactic rules of the AVG. 
The syntactic rules are extended in the following way. The numbers POLY and 
SIZE of the daughter nonterminals are collected in the lists plist and slist. 
Both lists are summed. The number size of the mother nonterminal is equal 
to the sum of size's, and the number poly of the mother nonterminal is one 
more than the sum of poly's. Thus, if (Xq,Xi, . . . , X n , eft) is a syntactic rule of 
the AVG, then (Xq,X\, . . . , X n ,ip) is a syntactic rule of the HP-AVG, where 

ijj = cp\ GRAMMAR 

A SUMS counter(xo) = £ SLIST counter(x ) 

A SIZE COUNTER(xo) = sums counter(x ) 

A SUMP COUNTER(xo) = S PLIST COUNTER(xo) 

A sump counter(xo) = y 

A POLY counter(xo) =y + l 

A F R* SLIST COUNTER(xo) = SIZE COUNTER(xj) (0 < i < n) 

A R" SLIST COUNTER(xo) = + 
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A FR* PLIST COUNTER(xo) = POLY COUNTER^) (0 < % < n) 
A R n PLIST COUNTER(xo) = + 

Now, a derivation for the HP-AVG starts with a nondeterministic construc- 
tion of a counter size with value n and a counter poly with value 0{n k ). Then, 
the derivation of the original AVG is simulated, such that (i) the mother non- 
terminal produces a string of size n if, and only if the daughter nonterminals 
together produce a string of size n, and (ii) the mother nonterminal makes 
n k + 1 derivation steps if, and only if the daughter nonterminals together make 
n k derivation steps. 



